A Representation Framework for Cross-lingual/Interlingual Lexical Semantic Correspondences
نویسنده
چکیده
This paper proposes a framework for representing cross-lingual/interlingual lexical semantic correspondences that are expected to be recovered through a series of on-demand/on-the-fly invocations of a lexical semantic matching process. One of the central notions of the proposed framework is a pseudo synset, which is introduced to represent a cross-lingual/multilingual lexical concept, jointly denoted by word senses in more than one language. Another important ingredient of the proposed framework is a framework for semantifying bilingual lexical resource entries. This is a necessary substep when associating and representing corresponding lexical concepts in different languages by using bilingual lexical resources. Based on these devices, this paper further discusses possible extensions to the ISO standard lexical markup framework (LMF). These extensions would enable recovered correspondences to be organized as a dynamic secondary language resource, while keeping the existing primary language resources intact.
منابع مشابه
A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources
Given lexical-semantic resources in different languages, it is useful to establish cross-lingual correspondences, preferably with semantic relation labels, between the concept nodes in these resources. This paper presents a framework for enabling a cross-lingual/nodewise alignment of lexical-semantic resources, where cross-lingual correspondence candidates are first discovered and ranked, and t...
متن کاملInterlinguas: Deep and Shallow
In 1629, Descartes proposed a “language of true philosophy” to serve as an interlingua for translating between languages. Over 300 years later, “semantic interlingua” appears on the top of the Vauquois triangle, as the deepest possible analysis guaranteeing the best possible translation. But the main stream of machine translation has considered the interlingua unrealistic and worked on lower le...
متن کاملDevelopment of Cross-linguistic Syntactic and Semantic Parameters for Parsing and Generation 1
This document reports on research conducted at the University of Maryland for the Korean/English Machine Translation (MT) project. The translation approach adopted here is interlingual i.e., a single underlying representation called Lexical Conceptual Structure (LCS) is used for both Korean and English. The primary focus of this investigation concerns the notion of`parameterization' i.e., a mec...
متن کاملAny-language frame-semantic parsing
We present a multilingual corpus of Wikipedia and Twitter texts annotated with FRAMENET 1.5 semantic frames in nine different languages, as well as a novel technique for weakly supervised cross-lingual frame-semantic parsing. Our approach only assumes the existence of linked, comparable source and target language corpora (e.g., Wikipedia) and a bilingual dictionary (e.g., Wiktionary or BABELNET...
متن کاملTowards Best Practice for Multiword Expressions in Computational Lexicons
The importance and role of multi-word expressions (MWE) in the description and processing of natural language has been long recognized. However, multi-word information has often been relegated to the marginal role of idiosyncratic lexical information. The need for MWE lexicons grows even more acute for multi-lingual applications, for which (sometimes complex) correspondences must be identified,...
متن کامل